Biomedclip PubMedBERT 256 Vit Base Patch16 224
MIT
BiomedCLIP is a biomedical vision-language foundation model pre-trained via contrastive learning on the PMC-15M dataset, supporting cross-modal retrieval, image classification, visual question answering, and other tasks.